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Abstract 

The measurement of thermal fluctuations provides information about 
the microscopic state of a thermodynamic system and can be used in order 
to extract work from a single heat bath in a suitable cyclic process. We 
present a minimal framework for the modeling of a measurement device 
and we propose a protocol for the measurement of thermal fluctuations. In 
this framework, the measurement of thermal fluctuations naturally leads 
to the dissipation of work. We illustrate this framework on a simple two 
states system inspired by the Szilard's information engine. 

1 Introduction 

In his seminal paper of 1929, Leo Szilard was the first to point out the role 
of information in statistical thermodynamics [I]. Recently, experimental and 
theoretical work have specified the relation between information and dissipated 
work in the thermodynamics of small systems [U |3l IH [5j [6l [7l IH |9] . In its tra- 
ditional formulation, the second law of thermodynamics states that the average 
work W needed to change the state of a system in contact with a heat bath is 
bounded from below by the difference in free energy of the final and the initial 
states: 

W > AF. (1) 

In the presence of measurement and feedback during the process, the second 
law has to be extended in order to include the information obtained through the 
measurement and the bound on the work to perform is lowered [71 [H [9l [TOl [11] : 

W >AF- kTI, (2) 

where / is the mutual information of the system state and the measurement 
outcome, k is Boltzmann's constant and T is the temperature of the heat bath. 



A striking consequence of this relation is the theoretical possibility to extract 
work out of a single heat bath during a cyclic process. The second law of 
thermodynamics prohibits such processes. In order to re-establish the second 
law it is therefore essential that the acquisition (and/or processing) of an amount 
/ of information leads to the dissipation of at least kTI of work. 

Landauer and Bennett indeed focused on the processing of the information 
|121ll3j . They argue that, in order to utilize some information, one has to record 
it on some memory device and eventually to erase it. Landauer's principle 
sates, that this erasure step is necessarily accompanied by a minimum amount 
of entropy production sufficient to balance the entropy reduction due to the 
feed-back process. This principle has been the subject of a lot of studies, see 
e.g. [Hill]. 

The aim of this paper is to propose a framework for the modeling of a mea- 
surement device. Under thermodynamically consistent assumptions, the mea- 
surement of thermal fluctuations naturally leads to dissipation in a way similar 
to Landauer's principle. The basic assumptions about the measurement device 
are the following: it should be a thermodynamic system subject to thermal 
fluctuations and it should receive information from the system on which the 
measurement is performed. The first assumption implies that the measurement 
errors should at least include the thermal fluctuations inside the measurement 
device. The second assumption implies that the measurement device is driven 
by the original system. In our framework, it is this driving that is responsible 
for the entropy production inside the measurement device 



2 Modeling the measurement device 

We wish to measure a certain quantity x. Here, x is thought of as a random 
variable distributed according to some probability distribution p{x) . The knowl- 
edge that we have about x is given by the Shannon entropy of p{x) given by 

m-- 

S[p{x)] = -J2p{^)^ogp{x). (3) 

The lower it is, the more information we have about x. By measuring x, we mean 
acquiring some information about a single realization of this random variable. 
Let y be the result of the measurement, distributed according to the conditional 
distribution p{y\x) for fixed x. If we know the value of y, then our knowledge 
about the value of x changes and x is distributed according to the conditional 
probability distribution: 

^^(-1^) - (4) 

where p(y) = '^xPiy\x)p{x) is the marginal distribution of y, i.e. the a priori 
probability to observe outcome y. The entropy of x after observing y is the 
Shannon entropy of p{x\y): 

S[p{x\y)] = ~^p{x\y) \ogp{x\y). (5) 
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This quantity depends on the measurement outcome y. On average over y, it 
is smaller than the Shannon entropy of p{x) given by eq. ([3]), meaning that 
knowing the value of y increases our information about x. The average decrease 
of entropy of x upon knowing the value of y is the mutual information between 
X and y [TB] : 

/ = S[pix)]-Y,Piy)S[p{x\y)] 
y 

^ M p{x,y) 

where p{x,y) — p{y\x)p{x) — p{x\y)p{y) is the joint distribution of x and y. 
The mutual information / is positive and it is zero if and only if x and y are 
independent. 

If X is the microscopic (or mesoscopic) state of a thermodynamical system in 
equilibrium with a heat bath at temperature T, then the information obtained 
can be used to extract heat from the heat bath and convert it into work (TJ 
HI [3 H HI |9]. More precisely, let x be the microscopic state (or micro-state) 
of a thermodynamic system S in contact with a heat bath at temperature T 
and let p{x) be its equilibrium (canonical) distribution. If one knows the value 
of y, then the micro-states of S are distributed according to p{x\y) given by 
eq. Q . This distribution is a non-equilibrium one and exploiting the relaxation 
of S to equilibrium allows one to extract a maximum average amount of work 
linked to the Kullback-Leibler distance or relative entropy of the non-equilibrium 
distribution p{x\y) and the equilibrium one p{x) [11 16]: 

W^M = kTj2p{x\y)\og^^, (7) 

X P^^' 

where k is Boltzmann's constant. On average over y, one obtains: 

W^max = V P(2/) W„,ax(y) = kT I , (8) 



y 



where / is the mutual information between x and y given by eq. 

The measurement device should be a physical system obeying the laws of 
thermodynamics. Moreover, it should receive information from the original 
system S. These considerations lead us to consider the measurement device as 
a thermodynamic system M and the measurement outcome ?/ as a micro-state 
of M. The energy levels of M depend on the value x to be measured in such a 
way, that p{y\x) is the equilibrium distribution of y. Denoting by _Ej\/(y|a;) the 
energy of M when S is in state x and M in state y, we have: 

/ I N f EM{y\x) - Fm{x)\ . 
p{y\x) = exp I — I , (9) 

where Fm{x) = —kTlogJ2y^^Pi^EMiy\x)/kT) is the equilibrium free energy 
of M given x. Every time the value of x changes, M is driven away from 
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equilibrium. Furthermore, we suppose the relaxation time of M to be much 
smaller than the relaxation time of 5', so that M always has the time to relax 
towards the canonical distribution ^ before the value of x changes. 

During the measurement, M is in a probabilistic mixture of macro-states 
[T7] . By this, we mean, that the macroscopic state of M is random. This is so 
because it depends on x which is itself random. The measurement errors come 
form the difficulty to distinguish different macro-states of M upon seeing one 
realization of y. In fact, different p{y\x) for different values of x may overlap, 
meaning that different values of x may be compatible with one value of y. 

3 Measuring the state of a two levels system 

We will start with a simple example inspired by the Szilard's engine [TJ \TE\ in 
order to introduce the measurement protocol, and then we will consider a more 
general case. The aim is to measure the state of a two levels system S in contact 
with a heat bath at constant temperature T, possibly with measurement errors. 
The information obtained through the measurement is then used in order to 
extract some heat out of the heat bath and convert it into work. We denote 
hy X € {1,2} the state of S and hy y £ {1,2} the result of the measurement. 
Initially, S is in equilibrium with the heat bath and the energies of its levels are 
equal so that it has equal probability to be in one state or the other. At some 
point, we measure the state of S with success probability p > 1/2: 

By a cyclic process depending on the measurement outcome one can extract 
at most Wmax = kTI of work, where / is the information gained through the 
measurement [51 1^ [T5]: 

I{p)=p\og^ + {l-p)log^ (11) 

for this specific system. 

The measurement device AI is a two level system as well since the measure- 
ment has two possible outcomes. Its energy levels are separated by a gap linked 
to p by 

AE^-kT\og-^, (12) 
1-p 

so that p{y\x) given by eq. (jlOp is the equilibrium distribution for y. In other 
words: 

f I \ / -kT\ogp + Fm if y = , , 

^"'^y^'^^ = I -A:Tlog(l -p) + Fm if y^x, 

where Fm is the free energy of M . Hence, during the measurement, each macro- 
scopic state of M is labeled by a state of S. 

The measurement process consists of three steps: 
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1. Initially, M is independent of S and S occupies one of its two states with 
equal probability. 

2. At some point M is "put in contact" with S and its energies are switched to 
the values given by eq. (jl3p . It relaxes immediately towards equilibrium, 
so that it is described by the conditional distribution ^TU\\ . Having a look 
at the value of y yields on average information I{p), given by eq. (jlip 
about the value of x. 

3. Finally, M is "decoupled" from S in the sense that it does not anymore 
receive information from it. A protocol fed back with the result of the 
measurement is performed on S yielding a maximum average amount of 
kTI{p) of work. 

The problem is now to find the quantity of work performed on M during this 
cycle of transformations. But before doing so, let us make one short remark. 
It is important that M does not stay in "contact" with S during the whole 
process. If it did, then each time S would jump from one state to the other 
because of a thermal fluctuation, work would be performed on M. Hence, in 
order to dissipate the smallest possible amount of work, it is important to make 
the contact as short as possible. It should be just long enough to yield the 
desired information, but not longer. 

We first need to specify the macroscopic state of M in steps 1 and 3, that 
is, when it is not coupled to S. For the process to be cyclic, we require these 
states to be the same. Consider two possibilities: 

is in some standard state, with equal energies E^j. 

• When M is decoupled from S, it is just left as it is. In other words, it 
is in a statistical mixture of macroscopic states: with probability 1/2 it is 
in a state with energies EM{y\^) and with probability 1/2 in a state with 
energies EM{y\2). 

In the first case, work is performed on M in step [5] and [H i.e. when AI is 
put in contact with S and when it is separated from S. When M is put in 
contact to S, its energy levels are instantly moved from to EM{y\x). The 
averaged work dissipated when one instantly changes the energies of a system is 
given by kT times the Kullback-Leibler distance between the initial equilibrium 
distribution and the final one. This is a special case of the Kawai-Parrondo- 
Brock equality ^9, j20, . We will make use of this relation all along this article to 
calculate the work performed at each steps of the process, the work performed 
when M is put in contact with S is thus given by: 

W^cont = ^ flog + log ^\ + AFm, (14) 

where AFm = Fm — is the difi^erence in the free energy of M before and 
after the contact. When M is separated from S, the work performed on average 
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is: 



Wsep = kT [p\og^ + {l-p) log ^ j - AFm . (15) 

The first term on the right hand side of this equation is the mutual information 
between x and y, eq. (jll[) . Thus 

W,ep-A:TI(p)-AFAf. (16) 

The overall work performed is: 

Wtot = Wcont + W^sep (17) 

= fcr/(p) + ^ flog + log 



2 \ p l-p 

Obviously, the work performed on M is greater than the work that can be 
extracted from S using the information provided by the measurement because: 

D{p) = \ flog ^ + log ^\ > (18) 
2 V P ^-Pj 

with equality if and only if p = 1/2, i.e. when the measurement does not provide 
any information. 

In the second case, work is only performed during the contact since M is left 
unchanged after the measurement. During the contact, with probability 1/2 M 
does not change and no work is performed on it and with probability 1/2, the 
energies of the levels of M are exchanged. The average work performed is: 

W ^^{p\og-^ + {l-p)\og^—^\. (19) 
2 V 1 -P P J 

One can show that W = VFtot, meaning that it makes no difference whether 
one uses a definite standard state or not. In the next section, we will generalize 
this result to the general case of a measurement with an arbitrary number of 
outcomes with an arbitrary distribution. 

As can be seen on figure [H D{p) > I{p). Hence, the contribution of the 
contact step to the dissipated work is greater than the contribution of separation 
step. 



4 Generalization 

The situation is similar in the general case. Consider the setup depicted in 
section O S" is a thermodynamic system in contact with a heat bath at tem- 
perature T and p{x) its canonical distribution. The measurement device is also 
a thermodynamic system in contact with the heat bath and the energy of its 
micro-state either have some standard value E%j{y) or are given by eq. ([T0| . i.e. 
they are such that p{y\x) is the canonical distribution for M for a given value 
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Figure 1: The two different contributions D{p) and I{p) to the work performed 
on M during the measurement process as a function of the probabihty p of a 
successful measurement. D{p) is the contribution due to the contact and I{jp) 
is the contribution due to the separation. 

of X. Since the states of M correspond to the possible measurement outcomes, 
it would make no sense that y can take more values than x. However, formally, 
the following derivation is still valid in that case. 

As in the previous section, we consider two different protocols for the mea- 
surement. Either the energy levels of M are driven from the standard values 
Elj{y) to the values corresponding to the measurement, EM{y\x) and then back 
to the standard values. Or the energies of M initially have the values given by 
the previous measurement, i.e. i?j\/(y|a;') with probability p{x'), x' being the 
state of S during the previous cycle, and are driven to the values corresponding 
to the actual measurement. We will see that for a suitable choice for £'°^(y), 
the two protocols give the same value for the work performed on M, as in the 
case of a two states system. 

We set the values of E\j{y) so that the marginal distribution of the mea- 
surement outcome y is an equilibrium one: 

Elj{y) = -kT\ogp{y)+Fl,, (20) 

where p{y) = Y^xP{y\x)p{x) and is the free energy of M in the standard 
state. If S is in state x when the measurement is carried on, then the average 
work performed on M during the contact is given by: 

H^cont(x) = kry^piy) log + AFuix), (21) 

where AFm{x) = F^ — Fm{x) is the change in free energy of M during the 
process. Hence, on average over x, the work performed is 

VFcont =^p(x)Wco„t(x). (22) 
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The work performed during the separation, conditioned on x is given by 



Ws,p{x) = fcT^p(y|a;)log^^ - AFm{x). (23) 

As in the previous section, the average over x of this quantity is hnked to the 
mutual information between x and y: 

Wsep = J2 P(2^)^Bcp (x) = kTI - AFm , (24) 

X 

where AFm = ^,j;P{x)AFm{x) is the average change in free energy of M and 
/ is the mutual information given by eq. ([6|). The total work performed is the 
sum of the two contributions. It can be rewritten in following form: 

Wtot = kTj2{p{x,y) -p{x)p{y)) log p{y\x). (25) 

Let us now consider the second protocol. The measurement device is left 
untouched after the previous measurement and is thus in equilibrium with its 
energies set to EMiy\x') for a certain x' appearing with probability p{x'). Here, 
x' is the state the system S was during the previous measurement. After the 
contact, the energies of M are set to EM{y\x), where x is the current state of 
S, appearing with probability p{x) (independently of x'). Given x' and x, the 
average work performed on M is: 

W\x',x) = fcTVp(y|x')log^^ + FM(a:) -Fm(x'). (26) 

y p{y\x) 

On average over x' and x, we obtain following expression: 
W' = ^p{x')p{x)W{x',x) 

x' ,X 

= kTj2 p{x')p{x)p{y\x') \ogP^. (27) 

Using the fact that p{y) — J2x' P{y\^')pi^')^ o^i^ can bring the above expression 
to the same form as eq. (l25t : 



W = kTj2 {P{x,y) ~ p{x)p{y))\ogp{y\x) = W^ot- (28) 



x,y 



This extends the result obtained in the previous section for a two states system. 
It means that switching M from a random state to another, or switching it 
from the standard state to a random state and then back to the standard state 
involves the same quantity of work on average. 
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5 Relation to Landauer's principle 



In its original formulation, Landauer's principle states that erasing one bit of 
information is necessarily accompanied by the dissipation of at least kT log 2 of 
work [12j . Bennett used Landauer's principle to propose a solution to Szilard's 
paradox |13| . Let us briefly sketch the argument behind Landauer's principle. 
Consider a one particle gas in a closed container in contact with a heat bath. 
The volume is divided in two by a removable partition. If the particle is in 
the left half of the container, it encodes one value, say "0" and if it is in the 
right half of the container, it encodes the other value, "1" in this case. Erasing 
the information contained in the memory means bringing the memory to a 
standard state, say the state "0", without knowing the value that is encoded. 
Such a protocol need to bring both states "0" and "1" to "0" . An obvious way to 
proceed is to remove the partition and then compress the gas in the left half of 
the container. Such a process dissipates at least A:Tlog2. The dissipation occurs 
during the free expansion of the gas right after the removal of the partition: the 
volume of the gas is doubled, i.e. its entropy is increased by fclog2 and no 
heat is exchanged with the heat bath meaning that the entropy of the latter 
stays constant during this process. Compressing the gas in the left half of the 
container needs kT log 2 of work which is then transferred to the heat bath in 
form of heat, thereby increasing its entropy by k log 2. This result was extended 
to general memories, see e.g. [15] . 

If the bit was used to store the result of the measurement of the state of 
a two levels system, as in section [31 then the work needed to erase the bit 
surely compensates the maximum work that can be extracted with help of the 
measurement because /(p) < log 2 with equality if and only li p — 1 (or p = 0), 
i.e. when there are no measurement errors. In [13 , Bennett argues that the 
measurement can be performed reversibly. By this, he means that as long as we 
know in which state the memory is (say in the state encoding "0" , the standard 
state), we can reversibly drive it to "0" or to "1" according to the result of the 
measurement. We will call this process "recording of the information" rather 
than " measurement" . However, this argument implies that the system S does 
not evolve during the recording, so that the information is still useful when the 
process of recording is finished. This is not the case in general. Consider for 
instance the (imperfect) measurement of the position of a Brownian particle as 
presented in [3]: once one has measured the position of the particle, one should 
immediately perform a process depending on the measured position in order 
to convert all of the information into heat. In fact, the particle does not stop 
moving after the measurement. So if one takes an infinite time to reversibly 
record the information before using it, then once it will be recorded it will bring 
nothing anymore because the system will have relaxed back to equilibrium. 

In our framework, the information about the result of the measurement is 
stored in a way similar to the situation considered in Landauer's principle: the 
measurement device is in a probabilistic superposition of macroscopic states, on 
per value of x and appearing with probability p{x). However, there are two im- 
portant differences. We do not only consider dissipation during the erasure step 
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but also during the measurement itself. As argued above, reversible recording 
of the information implies that the fluctuation about which the information is 
recorded is "frozen" . If instead the system is still in contact with the heat bath 
and continues to evolve, then the driving of the measurement device needs to be 
fast. However, the main reason for the instantaneous changes in the energies of 
M is that it should be directly driven by S: either the energies have a value that 
directly depends on x or not. But we consider no in between. In our opinion, 
this is what makes the difference between "measurement" and "recording" . We 
consider "recording" to be the following process: we know the outcome of some 
measurement and we want to record it to some memory device. This can be 
done infinitely slowly and hence with an arbitrarily small dissipation. On the 
other hand, a "measurement" device is directly driven by the information which 
is measured, i.e. we do not have a direct control on it. What then makes the 
information utilizable is the fact that M relaxes instantly. If M is not fully 
relaxed, then the information obtained is less, but the work performed is the 
same. 

Finally, let us remark that Landauer's erasure step is analogous to our "sep- 
aration" step. During this step, the measurement device is brought from a mix- 
ture of macro-states to the standard state similarly to Landauer's bit, which is 
brought from a statistical mixture of "0" and "1" to the standard state "0". 
And in fact, the work dissipated during this process is fcT/, i.e. it exactly 
compensates the work extracted from the heat bath using the information /. 
One big difference however, is the presence of measurement errors. As already 
mentioned, in the situation considered here, unlike in the classical situation 
usually involved in Landauer's principle, the different p{y\x) may overlap for 
different values of x. Sagawa and Ueda extended Landauer's principle to the 
situation with measurement errors [TS]. But in their framework, the measure- 
ment errors consist of an erroneous recording of the information and the different 
macro-states of the memory encoding the different measurement outcome are 
still perfectly distinguishable. 

6 Conclusion 

We have presented a very simple model for a measurement device and a protocol 
for the measurement of thermal fluctuations. The basic considerations motivat- 
ing this model are the following: the measurement device should be a physical 
system and should obey the laws of thermodynamics and its state should depend 
on the value which is measured. In particular, the measurement errors should 
at least include thermal fluctuations. In that respect, the model presented here 
is minimal: the measurement errors are only due to thermal fluctuations. We 
showed that under these assumptions, the measurement process itself already 
leads to dissipation of work in addition to the dissipation due to Landauer's 
erasure principle. 

We also showed that the work performed on the measurement device is the 
same, whether the measurement device is intialized in and eventually brought 
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back to a standard state, or is simply left as it is at the end of the measurement 
process. 
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